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Abstract 

We describe a robust approach for linking al- 
ready existing lexical/semantic hierarchies. We 
use a constraint satisfaction algorithm (relax- 



tioa labelling) to spWt. -amnng a set of_ 



candidates- the node in a target taxonomy that 
bests matches each node in a source taxonomy. 
In this paper we present the complete mapping 
of the nominal, verbal, adjectival and adverbial 
parts of WordNet 1.5 onto WordNet 1.6. 

1 Introduction 

There is an increasing need of having available 
general, accurate and broad coverage multilin- 
gual lexical/semantic resources for developing 
NL applications. Thus, a very active field in- 
side nl during the last years has been the fast 
development of generic language resources. 

Several attempts have been performed to 
connect already existing ontologies. For in- 
stance, to (semi) automatically link Spanish tax- 
onomies extracted from dgile (Alvar, 1987) 



to their English analogous from ldoce (Ageno 
et al., 199^ ), or to WordNet flMiller et al. 



1991 ) synsets flRigau et al., 1995| ) ; the co nstrue 
tion of large multilingual lexicons as in ([Knight| 
and Luk, 1994; Okumura and Hovy, 1994|) ; or 
the alignment of edr and WordNet ontologies 



(Utiyama and Hasida, 1997). Also, several lex- 
ical resources and techniques are combined in 
(Atserias et al., 1997; Farreres et al., 1998; ) to 
map Spanish words from a bilingual dictionary 
to WordNet . 

The use of relaxation labelling algorithm to 
attach substantial fragments of the Spanish tax- 
onomy derived from dgile flRigau et al., 1998 ) 
to the English WordNet has been reported in 



(Daude et al., 1999). The mapping of the nom- 
inal parts of wn1.5 to wn1.6 is presented in 
flDaude et al., 200q ). 



In this paper we use the same approach to 
map the verbal, adjectival and adverbial parts 
of WNl.5 to WNl.6, and present some improve- 
ments on the results presented in flDaude et al 



2000 ). We provide evaluation of the mapping 



accuracy by means of hand checking a random 
sample of the taxonomies. 

This paper is organized as follows: In sec- 
tion [2] we describe the used technique (the re- 
laxation labelling algorithm) and its application 
to hierarchy mapping. In section || we describe 
the constraints used in the relaxation process, 
and finally, section || details performed tests and 
evaluation of achieved results. 

2 Application of Relaxation 
Labelling to nlp 

Relaxation labelling (rl) is a generic name for 
a family of iterative algorithms which perform 
function optimization, based on local informa- 
tion, but with global effects. See ( [Torras, 1989 ) 
for a summary. Its most remarkable feature is 
that it can deal with any kind of constraints, 
and the algorithm is independent of the com- 
plexity of the model. 

The algorithm has been applied to POS tag- 



ging (Mar quez and Padro, 1997 h shallow pars- 
ing ( Voutilainen and Padro, 1997|) and to word 
sense disambiguation ( Padro, 1998| ). 



2.1 Algorithm Description 

The Relaxation Labelling algorithm deals with 
a set of variables (which may represent words, 
synsets, etc.), each of which may take one 
among several different labels (pos tags, senses, 
mrd entries, etc.). There is also a set of con- 
straints which state compatibility or incompat- 
ibility of a combination of pairs variable-label. 

The aim of the algorithm is to find a weight 
assignment for each possible label for each vari- 



able, such that (a) the weights for the labels of 
the same variable add up to one, and (b) the 
weight assignment satisfies -to the maximum 
possible extent- the set of constraints. 

Summarizing, the algorithm performs con- 
straint satisfaction to solve a consistent la- 
belling problem. The followed steps are: 

1. Start with a random weight assignment. 

2. Compute the support value for each label 
of each variable. Support is computed ac- 
cording to the constraint set and to the cur- 
rent weights for labels belonging to context 
variables. 

3. Increase the weights of the labels more 
compatible with the context (larger sup- 
port) and decrease those of the less com- 
patible labels (smaller support). Weights 
are changed proportionally to the support 
received from the context. 

4. If a stopping/convergence criterion is sat- 
isfied, stop, otherwise go to step 2. We 
use the criterion of stopping when there are 
no more changes, although stopping crite- 
ria may range from simply a fixed number 
of iterations to more sophisticated heuristic 



procedures flEkhmdh and Rosenfeld, 1978|) . 



2.2 Application to taxonomy mapping 

As described in previous sections, the problem 
we are dealing with is to map two taxonomies. 
In this particular case, we are interested in map- 
ping WNl.5 to WNl.6, that is, assign each synset 
in the former to at least one synset in the later. 
The modeling of the problem is the following: 

• Each wn1.5 synset is a variable for the re- 
laxation algorithm. We will refer to it as 
source synset and to WNl.5 as source tax- 
onomy. 

• The possible labels for that variable are all 
the wn1.6 synsets which contain a word be- 
longing to the source synset. We will refer 
to them as target synsets and to WNl.6 as 
target taxonomy. 

• The algorithm will need constraints stating 
whether a WNl.6 synset is a suitable assign- 
ment for a WNl.5 synset. As described in 
section [| these constraints will rely mainly 
on the taxonomy structure. 



3 The Constraints 

Constraints are used by the relaxation labelling 
algorithm to increase or decrease the weight for 
a variable label. In our case, constraints in- 
crease the weights for the connections between 
a source synset and a target synset. Increasing 
the weight for a connection implies decreasing 
the weights for all the other possible connec- 
tions for the same source synset. To increase the 
weight for a connection, constraints take into 
account already connected nodes that have the 
same relationships in both taxonomies. 

3.1 Constraints for wn Nouns 

Although there is a wide range of relationships 
between WordNet synsets which can be used 
to build constraints, in ( paude et al., 200C| ) we 
focus on the hyper/hyponym relationships, in- 
creasing the weight for a connection when the 
involved nodes have hypernyms/hyponyms also 
connected. Hyper/hyponym relationships are 
considered either directly or indirectly (i.e. an- 
cestors or descendants), depending on the kind 
of constraint used. 




Figure 1: Example of connections between tax- 
onomies. 

Figure [l] shows an example of possible con- 
nections between two taxonomies. Connection 
C4 will have its weight increased due to C5, Cq 
and Ci, while connections C2 and C3 will have 
their weights decreased. 

In (paude et al., 2000[) we distinguish differ- 
ent kinds of constraints, depending on whether 
hyponyms, hypernyms or both are taken into 
account, on whether those relationships are con- 
sidered direct or indirect, and on which of both 
taxonomies recursion is allowed. Each con- 
straint may be used alone or combined with oth- 
ers. 



Thus, the constraints used in ( paude et al. 
2000D are: 



11 constraints. This group includes he, iio 
and iib constraints. All of them match an 
immediate-to-immediate relationship be- 
tween both taxonomies, as shown in fig- 
ure]^, where the arrows indicate an immedi- 
ate hypernymy relationship. The nodes on 
the left hand side correspond to the source 
taxonomy and the nodes on the right to 
the target hierarchy. The dotted line is the 
connection which weight will be increased 
due to the existence of the connection(s) 
indicated with a continuous line. 
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Figure 2: 11 constraints. 

Al constraints. This constraint group includes 
AlE, AIO and AlB. That is, constraints that 
allow recursion on the source taxonomy. 
A graphical representation is shown in fig- 
ure ||, where the + sign indicates that the 
hypernymy relationship represented by the 
arrow does not need to be immediate. In 
this case, this iteration is only allowed in 
the source taxonomy. 
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Figure 3: ai constraints. 

IA constraints: Are symmetrical to Al con- 
straints. In this case, recursion is allowed 
only on the target taxonomy. 

AA constraints: Include the same combina- 
tions than above, but allowing recursion on 
both sides, as presented in figure ||[ 
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Figure 4: aa constraints. 

3.2 Generalized Constraints 

Although the minimalist approach described 
above yields very good results for the WordNet 
nominal taxonomy using only hyper/hyponymy 
relationships, it becomes much less useful in the 
case of verbs -due to the very flat verb taxon- 
omy in wn- and completely useless for adjec- 
tives and adverbs, which do not have such a 
kind of relationships. 

Thus, we generalized the structural match- 
ing idea to relationships other than hy- 
per/hyponyms. In that way, we can use con- 
straints that involve any WN relationship such 
as antonymy, "also-see", "similar-to" , etc. 

This generalized constraint class has the 
schema presented in figure H, where a connec- 
tion between a source synset si and a target 
synset t\ is reinforced if there are a source synset 
s2 and a target synset t2 such that S1R.S2 and 
t\R,t2i where TZ may be any wn relationship. 
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Figure 5: Generalized constraints. 

This new scenario enables us to map the com- 
plete wn1.5 to wn1.6. Now we can talk about 
source and target graphs, since it is not only hy- 
per/hyponymy what defines the structure which 
constraints the mapping. 

Nevertheless, this graph mapping is per- 
formed incrementally for efficiency reasons, as 



described in section 3.4. 



3.3 Additional Heuristic Constraints 

Apart from the generalized structural con- 



straints presented in section |3.2| , some heuris- 
tic, non-structural constraints were also used to 



help the algorithm decide in some cases in which 
structure was not enough informative. For in- 
stance, the case where a leaf node in WNl.5 may 
map to two different leafs in wn1.6, both under 
the same parent, may be disambiguated success- 
fully with some simple heuristic constraints. 

A rough measure of similarity between two 
synsets can be computed using the number of 
word coincidences, the number of non-empty 
word coincidences in glosses, or -in the case of 
verb synsets- the number of frame coincidences. 

For each of these similarity measures, a non- 
structural constraint can be stated, which re- 
inforces a connection between a source and a 
target synset proportionally to their similarity. 

We denote these constraints with w, G and 
F, depending on whether word, gloss or frame 
coincidences are used to compute similarity. 

3.4 Constraints for wn Verbs, 
Adjectives and Adverbs 

In order to reduce the computation time for the 
mapping, we proceeded in three phases, follow- 
ing the relationships that establish dependences 
among different POS in WordNet, as described 
below: 

• First, we mapped nouns, using only hy- 
per /hyponymy relationships plus w and G 
additional constraints. 

Similarly, verbs were mapped using hy- 
per /hyponymy, antonymy and the "also- 
see" WN relationship, w, G and F addi- 
tional constraints were also used. 

• Second, adjectives were mapped using adj- 
to-adj relationships such as antonymy, 
"similar-to" and "also-see", as well as 
the adj-to-verb relationship "participle- 
of" and the adj-to-noun "pertains" and 
"attribute". w and G additional con- 
straints were also used. 

Notice that the graph used to check the 
constraints imposed by adj-to-verb and 
adj-to-noun relationships was the result of 
the previous steps. That means that verbs 
and nouns are already mapped, reduc- 
ing considerably the search space and ac- 
celerating relaxation labelling convergence. 
Since the noun and verb mappings do not 
depend on adjectives (there are no noun- 
to-adj relationships and only one verb-to- 



adj -not used to avoid circularity-), re- 
sults wouldn't have been greatly affected if 
the mappings had been performed in par- 
allel, but the convergence would have been 
slower. 

• The third and last phase is the adverb 
mapping, which is performed with the 
only adv-to-adv relationship, antonymy 
and with the adv-to-adj "derived" . w and 
G additional constraints were also used. 

Again, the noun, verb, and adjective 
graphs are taken static picture of the 
mapping obtained from previous phases. 

4 Experiments and Results 

In the performed tests we evaluated the perfor- 
mance of different constraint combinations for 
each phase. 

All figures presented in this section were com- 
puted by manually linking to wn1.6 a sample 
chosen from wn1.5, and then use this sample 
mapping as a reference. The validation sample 
consists of 1900 noun synsets, 1000 verb, 1000 
adjective and 300 adverb synsets. 

See ( paude et al., 20QO| ) for a comparison of 
our mapping with the SenseMap provided by 
PrincetonF]. 

Results are presented in tables § and |||. 
They present the results for increasingly com- 
plex constraint sets. 

First, table |] presents results with the basic 
constraint set for each part-of-speech. They 
consist of the hyper/hyponym relationships 
(when available) plus other available intra-POS 
relationships -i.e. relationships between synsets 
with the same POS. Namely, the constraint sets 
are: 

Nouns: AA hyper/hyponym constraint set - 
corresponding to the basic technique used 
in ( paude et al., 2000| ). 

Verbs: AA hyper/hyponym constraint set plus 
"also-see" and antonymy relationships in n 
(immediate-to-immediate) mode. 

Adjectives: Antonymy, "similar-to" and 
"also-see" , all of them in n mode. 

Adverbs: Antonymy in II mode. 



See wn web page at 



http:/ /www. cogsci. princeton. edu/~ wn/ 





Cover. 


ambiguous. 


overall 


N 


99.7% 


94.9%-99.6% 


97.6%-99.8% 


V 


96.9% 


93.5%-99.2% 


94.6%-99.2% 


A 


94.1% 


82.8%-98.9% 


89.5%-99.4% 


R 


80.8% 


97.5%-100.0% 


99.0%-100.0% 



Table 1: Precision-recall results for basic con- 
straint set. 

Second, table || shows the results for the basic 
set extended with the w and G constraints for 
nouns, adjectives and adverbs, and with w, G 
and F for verbs. 

Note that the results for verbs are lower over 
all synsets than over ambiguous ones. This 
is caused because some of the non-ambiguous 
WNl.5 synsets in the validation sample were 
hand marked to have no correspondence in 
WNl.6 (i.e. they were removed in the new 
version), which causes the precision for non- 
ambiguous nodes to be only 99.1%. 





Cover. 


ambiguous. 


overall 


N 


99.9% 


97.5%-97.7% 


98.8%-98.9% 


V 


99.8% 


99.4%-99.7% 


99.3%-99.6% 


A 


98.9% 


96.5%-98.8% 


97.9%-99.3% 


R 


99.5% 


97.5%-100.0% 


99.0%-100.0% 



Table 2: Precision-recall for basic+ wgf con- 
straint set 

Third, table |3] shows the results for the basic 
set extended with available extra-POS relation- 
ships, which are: 

Nouns: No extra-POS relationships. 

Verbs: No extra-POS relationships. 

Adjectives: "participle-of " , "attribute" (adj- 
to-noun) and "pertains-to" (adj-to-verb) , 
both in II mode. 

Adverbs: "derived-from" (adv-to-adj) , in n 
mode. 





Cover. 


ambiguous. 


overall 


N 








V 








A 


95.8% 


85.0%-98.9% 


90.9%-99.4% 


R 


88.0% 


69.2%-94.2% 


87.9%-98.1% 



Finally, Table |] shows the results obtained 
when using together the basic set, plus the WGF 
set, plus the extra-POS set. For nouns and verbs, 
results from table are repeated for clarity. 





Cover. 


ambiguous. 


overall 


N 


99.9% 


97.5%-97.7% 


98.8%-98.9% 


V 


99.8% 


99.4%-99.7% 


99.3%-99.6% 


A 


99.0% 


96.5%-99.1% 


97.9%-99.5% 


R 


99.6% 


98.3%-100.0% 


99.3%-100.0% 



Table 4: Precision-recall for complete con- 
straint set. 

Those figures indicate that in all cases, the 
use of additional constraints other than the ba- 
sic set, produces an increase in coverage. 

For the WGF constraints, this improvement 
is accompanied by a large ambiguity reduction 
(thus a narrower precision-recall interval) and 
an increment in precision, specially in verbs and 
adjectives. Adverbs increase coverage maintain- 
ing precision and recall, while nouns present a 
small improvement. 

The extra-POS constraints turned to be use- 
ful for adjectives, but harmful for adverbs when 
used alone. In both cases, they raised precision 
when used in conjunction with WGF constraints. 

The recall below 100% is caused by synsets 
having several candidates with the same struc- 
tural features. Since all candidates obtain the 
same support from the constraints, the algo- 
rithm cannot chose one or another and leaves 
it ambiguous. If the final support is under the 
threshold, no answer is computed as correct. 

The precision under 100% is caused by two 
kinds of errors: First, some WNl.5 synsets are 
splitted in WNl.6 and the human evaluator may 
consider that the best choice is not the one made 
by the system. Second, some WNl.5 synsets 
simply do not appear in wn1.6, but if the sys- 
tem assigns them anyway, they are computed as 
an error. 

5 Conclusions & Further Work 

We have applied the relaxation labelling algo- 
rithm to build a complete mapping between two 
different WordNet versions, and reported results 
on WNl.5 to WNl.6 complete mapping! 2 ]. 



Table 3: Precision-recall for basic+extra-POS 
constraint set 



2 The 



mapping 



http: / /www. lsi.upc.es/' 
have been provided. 



has been made 
nip. So far, 



public at 
37 licenses 



An incremental approach has been used to 
map separately the different wn POS files, using 
at each step the results of previous ones. 

Non-structural information such as synset 
words, glosses and verb frames has been proven 
useful in increasing coverage, reducing ambigu- 
ity, and improving performance. 
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